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Abstract 

For  a  directed  graph  G,  a  t-identifying  code  is  a  subset  S  C  V (G) 
with  the  property  that  for  each  vertex  v  €  V (G)  the  set  of  vertices  of 
S  reachable  from  v  by  a  directed  path  of  length  at  most  t  is  both  non¬ 
empty  and  unique.  A  graph  is  called  t-identifiable  if  there  exists  a  t- 
identifying  code.  This  paper  shows  that  the  de  Bruijn  graph  B(d,  n)  is 
t-identifiable  for  n  >  2t— 1,  and  is  not  t-identifiable  for  n  <  2t— 2.  This 
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paper  also  proves  that  a  i-identifying  code  for  t-identifiable  de  Bruijn 
graphs  must  contain  at  least  dn~1(d  —  1)  vertices.  Constructions  are 
given  to  show  that  this  lower  bound  is  achievable  for  f-identifying 
codes  when  n  >  2t.  Further  a  (possibly)  non-optimal  construction  is 
given  when  n  =  2t  —  1.  Additionally,  with  respect  to  B(d,  n )  this  paper 
proves  upper  and  lower  bounds  on  the  size  of  a  minimum  t-dominating 
set  (a  subset  with  the  property  that  every  vertex  is  at  distance  at  most 
t  from  the  subset),  that  the  minimum  size  of  a  directed  resolving  set 
(a  subset  with  the  property  that  every  vertex  of  the  graph  can  be 
distinguished  by  its  directed  distances  to  vertices  of  S )  is  dn~1(d  —  1), 
and  that  if  d  >  n  the  minimum  size  of  a  determining  set  (a  subset  S 
with  the  property  that  the  only  automorphism  that  fixes  S  pointwise 
is  the  trivial  automorphism)  is  |"  • 


1  Introduction 

First  introduced  in  1998  [9],  an  identifying  code  for  a  graph  G  is  a  subset 
S  C  V  ( G )  with  the  property  that  for  each  v  G  V ( G )  the  subset  of  vertices 
of  S  that  are  adjacent  to  v  is  non-empty  and  unique.  That  is,  each  vertex 
of  the  graph  is  uniquely  identifiable  by  the  non-empty  subset  of  vertices  of 
S  to  which  it  is  adjacent.  Note  that  not  all  graphs  have  an  identifying  code; 
those  that  do  are  called  identifiable.  A  graph  fails  to  be  identifiable  if  and 
only  if  it  contains  a  pair  of  vertices  with  the  same  closed  (in-)neighborhood; 
such  vertices  are  called  twins.  Extending  these  definitions  to  t-identifying 
and  t-twins  is  easy  and  is  covered  in  Section  3.  Identifying  codes  can  be 
quite  useful  in  applications.  For  example,  an  identifying  code  in  a  network 
of  smoke  detectors  allows  us  to  determine  the  exact  location  of  a  fire  given 
only  the  set  of  detectors  that  have  been  triggered.  However,  the  problem  of 
finding  identifying  codes  is  NP-Hard  [  ].  The  computational  cost  has  so  far 
limited  the  real-world  use  of  identifying  codes. 

The  directed  de  Bruijn  graph  B(d,n)  is  a  directed  graph  in  which  the 
vertices  are  strings  of  length  n  from  an  alphabet  A  with  d  letters,  and  with 
a  directed  arc  from  vertex  x\%2 . . .  xn  to  vertex  X2  ■  ■  ■  xna  for  each  a  £  A. 
When  looking  for  a  graph  model  for  applications,  it  is  useful  to  choose  a 
graph  with  relatively  few  edges,  but  many  short  paths  between  any  pair  of 
vertices  [  ].  The  de  Bruijn  graphs  have  both  of  these  desirable  properties. 
In  addition,  given  an  arbitrary  pair  of  vertices  in  a  de  Bruijn  graph,  there 
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Figure  1:  Vertex  Subset  Relationships 

are  routing  algorithms  that,  with  high  probability,  create  a  path  of  length 
O(logn)  between  the  pair  [13].  The  properties  of  de  Bruijn  graphs  enable 
some  problems  that  are  NP-complete  on  general  graphs,  such  as  the  Hamilton 
cycle  problem,  to  be  computationally  solvable  on  de  Bruijn  graphs  [  ].  We 

will  see  that  for  most  directed  de  Bruijn  graphs,  the  construction  of  minimum 
t- identifying  codes  is  indeed  solvable  (when  they  exist). 

Other  vertex  subsets  that  we  consider  for  de  Bruijn  graphs  in  this  pa¬ 
per  are  dominating  sets,  resolving  sets,  and  determining  sets.  Dominating 
sets  provide  complete  coverage  for  a  graph,  while  resolving  sets  provide  an 
identification  of  vertices  in  graphs  using  relative  distances.  Finally,  deter¬ 
mining  sets  provide  a  set  of  vertices  that  is  only  fixed  pointwise  by  the  trivial 
automorphism.  These  types  of  subsets  are  also  useful  in  applications.  For 
example,  resolving  sets  have  been  used  in  aiding  the  navigation  of  robots 
when  distances  to  sufficient  landmarks  are  known  [10],  and  determining  sets 
are  useful  in  graph  distinguishing  which  can  reduce  graph  symmetry  to  en¬ 
hance  recognition.  These  different  vertex  subsets  are  interrelated,  as  shown 
in  Figure  1.  For  example,  each  resolving  set  and  identifying  code  is  also  a 
determining  set,  but  not  vice-versa.  This  is  discussed  more  fully  in  Section 
4.1  and  Section  4.3. 

In  Section  2,  we  give  careful  definitions  necessary  for  working  with  di¬ 
rected  de  Bruijn  graphs.  In  Section  3,  we  prove  that  for  every  t-identikable 
de  Bruijn  graph,  any  t-identifying  code  must  contain  at  least  dn_1(d  — 1)  ver¬ 
tices.  We  prove  by  construction  that  this  bound  is  achievable  for  t-identifying 
codes  when  n  >  2 1.  For  n  =  2t  —  1  we  show  the  existence  of  a  t-identifying 
code  of  size  dn^1(d  —  1)  +  dt,  which  may  or  may  not  be  optimal  size.  Further¬ 
more,  when  n  <  2t  —  2,  we  are  able  to  show  that  B(d,  n)  has  no  t-identifying 
code  due  to  the  existence  of  t-twins.  In  Section  4  we  study  dominating 
sets,  directed  resolving  sets,  and  determining  sets  for  B(d,n).  Section  4.2 
gives  a  proof  that  the  minimum  size  of  a  directed  resolving  set  for  B(d,n )  is 
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dn  1(d  —  1),  and  Section  4.3  that  the  minimum  size  of  a  determining  set  is 


2  Definitions 

We  will  be  considering  various  types  of  vertex  subsets  on  the  class  of  directed 
de  Bruijn  graphs.  The  following  definitions  will  be  useful  in  working  with 
this  class  of  graphs.  We  will  use  the  notation  [x]  =  {1,2,...,  x}. 

Definition  2.1.  Let  Ad  =  {0, 1, . . . ,  d—  1}  and  let  And  be  the  set  of  all  strings 
of  length  n  made  up  of  letters  of  A.  When  d  is  clear  from  context  we  will 
use  A  and  An  respectively. 

— ^ 

Definition  2.2.  The  directed  de  Bruijn  graph,  denoted  B(d,  n),  has 

vertex  set  Ad.  An  edge  from  vertex  £1X2  . . .  xn  to  vertex  y\y2  . . .  yn  exists  if 
and  only  if  x2x3  ...xn  =  yxy2  . . .  yn- 1- 

Definition  2.3.  The  concatenation  of  two  strings  x  =  x±x2 . . .  x*  and 
y  =  2/12/2  ■  ■  ■  Vk  is  given  by  x  ©  y  =  xix2  . . .  Xiyiy2  ■■■yk- 

Definition  2.4.  The  concatenation  of  sets  of  strings  S  and  T  is  given 
by  S'  ©  T  =  {x  ©  1/  |  x  6  5  and  y  G  T}. 

Definition  2.5.  The  prefix  of  a  string  x  =  X\X2...xn  is  the  substring 
XiX2  •  •  •  xn_i,  denoted  by  x~ . 

Definition  2.6.  The  suffix  of  a  string  x  =  X1X2  . . .  xn  is  is  the  substring 
X2X3  . . .  xn,  denoted  by  x+ . 

Definition  2.7.  When  discussing  substrings  of  a  string  XiX2 . . .  xn,  we  will 
use  the  notation  x(a  :  b )  to  denote  the  substring  xaxa+i . . .  xj,. 

Definition  2.8.  If  a  string  x  =  X1X2 . . .  xn  contains  a  constant  substring 
x(a,b )  =  zz . . .  z,  then  we  will  denote  the  consecutive  letters  as  zb~a,  the 
constant  raised  to  the  power  denoting  length.  This  will  also  be  used  for 
repeated  substrings,  such  as  0101 ...  01  =  (01)fc. 


Definition  2.9.  Let  w  —  w\ .. .  wn  G  Ad.  Define 
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Definition  2.10.  Let  w  =  W\  .  .  ,wn  G  A)  and  £  G  Z+  such  that  n  >  2£. 
Then  we  say  that  w  has  period  length  £  if  Wi  —  wl+i  for  all  i  G  [n  —  £].  If  we 
have  n  <  2£,  then  we  say  that  w  has  almost  period  length  £. 

Definition  2.11.  Let  w  G  Ad,  and  suppose  that  w  has  period  length  £,  and 
does  not  have  period  length  k  for  any  k  <  l.  Then  w  is  called  £ -periodic. 

Definition  2.12.  Let  w  G  Ad.  If  there  exists  some  £  >  |  and  word  w'  G 
A(f~n  such  that  w  ©  w'  is  t'-periodic,  then  w  is  called  almost  £-periodic. 

We  now  provide  some  lemmas  regarding  string  properties  that  will  be 
used  later  in  the  paper. 

Lemma  2.13.  [  ]  Let  £i  >  £2  and  w  be  a  word  of  length  n  >  £\  +  £2  — 
gcd(£ i,£2).  If  w  has  periods  (or  almost  periods)  of  length  £\  and  £2  ,  then  w 
has  a  period  of  length  gcd(£i,£2). 

Lemma  2.14.  [  ]  Let  £\  >  £2  and  w  be  a  word  of  length  n  >  £\  +  £2.  If  w 
has  a  period  (or  almost  period)  of  length  £\  and  w^k,rn">  has  a  period  of  length 
£2  for  some  m  G  Ad,  then  there  is  m'  G  Ad  such  that  has  a  period  of 

length  gcd(£i,£2). 

Lemma  2.15.  Let  w  G  Ad  such  that  w  is  £\-periodic  or  almost  £i-periodic. 
Let  m  G  [d  —  1]  and  also  k  G  [n]  with  k  <  n  —  £\  or  k  >  £\.  Then  for 

any  £2  <  |  with  £\  >  £2  and  n  >  £ i  +  £2,  it  is  not  possible  that  is 

£2-periodic. 

Proof.  We  proceed  by  contradiction,  and  suppose  that  w^k'm^  is  ^-periodic. 
We  have  two  cases.  First,  if  k  >  £\,  then  by  Lemma  2.14,  there  exists  some 
ml  G  Ad  such  that  w(k’m  ^  has  period  of  length  gcd(ti,  £2).  Then  we  have  the 
following  chain  of  equalities. 

Wk  =  Wk-e-L  since  w  has  a  period  of  length  £\ 

=  Wk-i2  since  -uhfc’m  >  has  a  period  of  length  gcd(£i,  £2) 

=  w^'m^  since  w^k,rn'>  has  a  period  of  length  £2 

Hence  this  is  a  contradiction.  For  our  second  case,  when  k  <  n  —  £\,  we  note 
the  following. 

(k.m) 

wk  =  wk+e  1  =  Wk+e  2  =  Uik 

This  is  also  a  contradiction.  Therefore  we  must  have  that  u^’™)  is  not  I2- 
periodic.  □ 
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Lemma  2.16.  Let  w  G  Af  such  that  w  has  period  length  A  for  some  £\  < 
For  all  m  G  [d  —  1]  and  for  all  i,j,  k  G  [n]  with  i  <  k  <  j ,  and  for  all  £ 2  <  A 
with  j  —  i  +  1  >  A  +  £2  and  with  either  k  >  i  +  £  1  or  k  <  j  —  £\,  we  must 
have  that  w^k,m\i,j)  does  not  have  period  £2. 

Proof.  Define  w'  =  and  j),  and  then  apply  Lemma 

2.15  to  compare  the  two  strings.  □ 

Lemma  2.17.  Let  n  =  2t  and  let  u  G  Af.  If  u  has  period  length  t  and  for 
some  £  <  t  and  m  G  Ad,  we  find  thatu!  =  u^’m\t+ 1  —  £  :  n  —  1)  is  £-periodic, 
then  we  must  have  that  £  divides  t  and  u '  ©  ( un  +  m )  has  period  £. 

Proof  First,  we  note  that  Um  =  clearly  has  period  length  t, 

and  so  Um(t  +  1  —  £  :  n  —  1)  has  almost  period  length  t.  Additionally,  since 
Um(t  +  1  —  £  :  n  —  1)  =  u1,  we  know  that  Um(t  +  1  —  £  :  n  —  1)  is  Aperiodic. 
Hence  by  Lemma  2.13,  Um(t  + 1—£  :  n  —  1)  has  period  of  length  p  =  gcd(t,f?). 
However  since  u ’  is  given  to  be  Aperiodic,  the  minimum  period  length  is  £ 
and  so  we  must  have  that  p  =  £  and  thus  £  divides  t. 

To  show  that  u'  ®  (un  +  m)  has  period  £,  we  note  that  v!  has  period  £,  and 
that  u ^  =  u[t,m^  —  ut  +  m.  Having  period  £  implies  that  u'k  —  ut  +  m  for  all  k 
that  is  divisible  by  £.  Since  un  +  m  is  the  (t  +  £)th  letter  in  u'  ®  ( un  +  m ),  and 
this  is  divisible  by  £,  we  need  that  un  +  m  —  ut  +  m  in  order  for  v!  ©  [un  +  m) 
to  have  period  £.  But  this  is  given  to  be  true  since  u  has  period  t.  □ 

3  Identifying  Codes 

We  begin  by  building  up  to  the  definition  of  an  identifying  code.  This  requires 
careful  definitions  of  directed  distance  and  Aballs. 

Definition  3.1.  The  directed  distance  from  vertex  u  to  vertex  v  in  a  graph 
G  is  given  by  d(u,v),  and  is  defined  as  the  length  of  the  shortest  directed 
path  from  u  to  v  in  G. 

Definition  3.2.  Let  v  E  V (G).  The  open  in-neighborhood  of  v  is  given  by 
N~(v)  =  {u  G  V(G)  |  (u,v)  G  E(G)},  and  the  closed  in-neighborhood 
is  given  by  W_[n]  =  N~(v)  U  {u}. 

Note  that  out-neighborhoods  are  defined  analogously,  but  will  not  be 
needed  in  this  paper. 
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Figure  2:  A  1-identifying  code  in  the  graph  13(2,3)  (black  vertices).  A  2- 
identifying  code  in  this  graph  requires  all  vertices  but  one  of  {000,  111},  and 
there  are  no  f- identifying  codes  for  t  >  3. 

Definition  3.3.  The  in-ball  of  radius  t  centered  at  vertex  v  is  the  set: 

=  {u  G  V(G)  |  d(u,v )  <  t}.  and  the  out-ball  of  radius  t  centered 
at  vertex  v  is  the  set:  B^~ ( v )  ={u£  V(G)  \  d(v,u )  <  t}. 

The  two  following  lemmas  are  useful  in  working  with  distances  in  B(d,  n ) 
and  their  proofs  are  self-evident. 

Lemma  3.4.  In  B(d,  n )  there  is  a  directed  path  of  length  t  <  n  from  x  to  y 
if  and  only  if  x(t  +  1  :  n)  —  y(  1  :  n  —  t).  That  is,  if  and  only  if  the  rightmost 
n  —  t  letters  of  x  are  the  same  as  the  leftmost  n  —  t  letters  of  y. 

Lemma  3.5.  In  B(d,  n )  if  vertices  x  j -  y  have  the  same  prefix,  then  for  all 
u  7^  {x,y},d(u,x)  =  d(u,y ).  In  particular,  B^(x)  \  {x}  =  B^~(y)  \  {y}  for 
all  t  <  n. 

Definition  3.6.  Given  a  subset  S  C  V(G),  the  S  t-identifying  set  for 

vertex  v  is  given  by  IDs(u)  =  B^(v)  D  S. 

Definition  3.7.  A  t-identifying  code  is  set  S  C  V (G)  such  that  each  vertex 
has  a  unique,  non-empty  f-identifying  set.  That  is,  for  every  u  G  V(G), 
I Dg(w)  7^  0,  and  for  all  pairs  u,v  G  V(G)  we  have  ID s(u)  ^  IDs(u).  The 
variable  t  is  referred  to  as  the  radius  of  the  identifying  code.  See  Figure  2 
for  an  identifying  code  in  the  graph  B( 2,  3). 

In  the  above  definitions,  if  t  is  omitted  from  the  notation  (i.e.  identifying 
code  instead  of  f-identifying  code),  then  it  is  assumed  that  t  —  1. 

Note  that  not  every  graph  has  a  t-identifying  code  for  each  t.  In  particular 
if  the  graph  has  two  vertices  with  equal  in-balls  of  radius  t,  then  the  graph 
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has  no  identifying  code.  The  topic  of  such  ‘f-t  wins’  and  the  resulting  non¬ 
existence  of  f-identifying  codes  is  covered  in  Theorem  3.13. 

Theorem  3.8.  If  B(d,n)  is  a  t -identifiable  graph,  then  the  size  of  any  t- 
identifying  code  is  at  least  dn_1(d  —  1). 

Proof.  Choose  t  <  n  and  a  b  in  A.  Suppose  that  for  some  w  £  AP~l, 
neither  x  =  w  ©  a  nor  y  =  w  ©  b  is  a  set  S.  Since  x  and  y  share  a  prefix, 
by  Lemma  3.5,  Bf(x)  \  {x}  =  Bfi(y)  \  {y}.  Since  neither  x  nor  y  is  in  S, 
IDs(x)  =  ID5'(|/).  Thus  S  is  not  a  f-identifying  code.  Thus  for  each  w  £  An  1 , 
a  f-identifying  code  must  contain,  at  least,  all  but  one  of  w  ©  a  for  a  £  A. 
Thus  a  t- identifying  code  for  B(d,n)  must  have  size  at  least  dn^1{d  —  1).  □ 

Note  that  the  result  above  is  independent  of  the  radius  t.  An  interesting 
consequence  of  this  is  the  fact  that  increasing  the  radius  of  our  identifying 
code  does  not  produce  any  decrease  in  the  size  of  a  minimum  identifying 
code.  For  example,  consider  the  potential  application  of  identifying  codes 
in  sensor  networks.  One  might  think  that  by  increasing  the  sensing  power 
(which  corresponds  to  the  radius  of  the  identifying  code)  we  would  be  able  to 
place  fewer  sensors  and  thus  incur  a  savings  overall.  However,  Theorem  3.8 
implies  that  providing  more  powerful  (and  thus  more  expensive)  sensors  does 
not  allow  us  to  place  fewer  sensors.  Thus  we  should  use  sensors  that  have 
sensing  distance  equivalent  to  radius  one.  In  fact,  in  the  case  of  2-identifying 
codes  in  B( 2,3),  we  actually  require  an  extra  vertex  for  a  minimum  size  of 
seven! 

The  remainder  of  this  section  is  organized  as  follows.  We  first  provide 
a  construction  of  an  optimal  t-identifying  code  for  B(d,n )  with  t  >  2,  and 
n  >  2t  in  Theorems  3.9  and  3.10.  Following  the  proof  of  this  result,  we 
highlight  some  variations  that  provide  identifying  codes  for  several  other 
instances.  Finally,  we  highlight  an  alternative  construction  for  1-identifying 
codes  for  all  B(d,  n )  when  d  2  and  n  is  odd. 

Theorem  3.9.  Suppose  that  n  >  5,  d  >  2,  t  >  2,  and  n  >  2 1.  Then  the 
following  set  S  is  an  optimal  t-identifying  code  of  size  dn~1(d  —  1)  in  B(d,  n ) . 

S  =  {x  £  Arfi  I  for  some  m  and  ^  x^,m\t  +  1  —  I  :  n  —  1)  is  I-periodic, 
but  x^,in\t  +  1  —  I :  n  —  1)  ©  (xn  +  m)  is  not.} 

U 

{x  £  Afi  |  xt  xn  and  x^t,m\t  +  1  —  I  :  n  —  1) 

is  not  I-periodic  for  any  m  and  I  <t} 


Proof.  First,  let  us  note  that  for  all  x  G  A%,  if  +  1  —  £  :  n)  has  period 

l  for  any  m  and  £  <  t,  then  x  ^  S. 

Let  x  be  given,  and  define  Xi  to  be  the  ith  coordinate  of  x.  We  also  define 
the  following  string  xt] . 

{i,  if  k  —  f; 

j,  if  k  =  n; 

x(k),  otherwise. 

In  other  words,  xtJ  is  the  string  Xi . . .  xt-\ixt+i  . . .  xn_ \  j . 

Next,  we  note  that  Xij (t  +  1  —  l :  n  —  1)  =  x ^l~Xt) (t  +  l—£  :  n  —  1) ,  which 
implies  by  Lemma  2.16  that  Xij(t  +  1  —  £  :  n  —  1)  is  ^-periodic  for  at  most 
one  £  <  t  —  |  and  for  at  most  one  i  G  Ad- 

Next,  suppose  that  xab(t  +  1  —  £  :  n)  is  ^-periodic  and  consider  xVj .  We 
note  that  xffa~^  {t  +  l  — £  \  n  —  1)  =  xab(t  +  1  —  £  :  n  —  1),  which  is  ^-periodic. 

Hence  Xi  j  is  a  member  of  S  if  and  only  if  xffa~z\ti  —  £  :  n  —  1)  ©  (j  +  a  —  i) 
is  not  ^-periodic.  In  other  words,  if  and  only  if  i  —  j  ^  a  —  b  mod  d.  On  the 
other  hand,  if  Xi3  (t  +  1  —  £  :  n  —  1)  is  not  ^-periodic  for  any  £  and  i  G  Ad , 
then  x^  G  S  if  and  only  if  i  j -  j.  Hence,  there  is  exactly  one  j  G  Ad  such 
that  xaj  fL  S,  and  similarly  there  is  exactly  one  i  G  Ad  such  that  Xib  S. 
These  pairings  tell  us  that  dn~l  strings  are  not  in  S,  leaving  the  cardinality 
of  S  at  dn  —  dn_1,  or  dn~1(d  —  1). 

Now  that  we  have  established  the  cardinality  of  S',  we  must  show  that  no 
two  nodes  have  the  same  identifying  sets.  Let  x,  y  G  B(d,n),  and  consider 
their  identifying  sets,  called  I(x)  and  I(y),  respectively.  Let  k  be  the  smallest 
index  such  that  Xk  j -  Uk- 


k 


1:  Without  loss  of  generality,  we  may  assume  that  Xi  —  0  and  y\  —  1. 
Observe  that  we  have  the  following  strings  contained  in  the  identifying 
sets. 


x'  = 

04_1  ©  0  ©  x{l  : 

:  n  —  t) 

G 

x"  = 

0i_1  ©  1  ©  x(l  : 

:  n  —  t) 

G 

y'  = 

lt_1®  1©2/(1  : 

:  n  —  t) 

G 

Bf(y),  and 

y"  = 

lt_1©0®j/(l  : 

:  n  —  t) 

G 

Bf(y). 

Note  that  x'  qL  Bf(y)  and  y'  qL  Bf  (x) ,  since  Bf(x)  does  not  contain  any 
vertices  beginning  with  lm  and  Bf(y)  does  not  contain  any  vertices 
beginning  with  0i+1.  Next,  we  notice  that  at  least  one  {x',x"}  is  in  S. 
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To  see  this,  we  note  that  x"  =  By  the  same  point,  we  must  have 

that  at  least  one  of  y y"  is  a  member  of  S. 

Next,  we  note  that  if  at  least  one  of  x',y'  is  a  member  of  S,  we  can 
use  that  string  to  separate  y.  Otherwise,  if  either  x"  B^~(y)  0  S  or 
y"  qL  B 't  (x)  fl  S,  we  can  separate  x  and  y  with  the  given  string.  As  a 
last  resort,  we  consider  the  case  in  which  we  have  x"  G  B^{y)  0  S  and 
y"  G  B^(x)  D  S.  Then  we  must  have: 

x"  =  0t_1  ©  y(l  :  n  —  t  +  1), 

y"  _  jt-i  0  .  n  _  t  ^ 

as  x'J  is  the  only  1  in  x"{l  :  t  +  1)  and  y"  is  the  only  0  in  y"(  1  :  t  +  1). 
From  this,  we  get  the  following  string  equalities: 

x(l:n  —  t)  =  y(2:n  —  t  +  l), 
y(l\  n  —  t)  =  x(2  :  n  —  t  +  1). 

From  these  string  equalities,  we  see  that  ay  =  yi+ 1  =  Xi+ 2  and  ay  = 
Xi+ 1  =  ay+ 2  for  all  i  =  1,  2, ...,  n  —  t—  1,  and  so  x{l  :  n—t)  and  y(  1  :  n—t) 
are  both  2-periodic  (since  ay  7^  we  cannot  have  1-periodic).  Hence, 
—  1  :  n)  =  0  ©  1  ©  x(l  :  n  —  t)  is  also  2-periodic  (recall  that  ay  =  0 
and  iji  =  1),  so  x"  qL  S.  Hence  we  must  have  x'  G  S,  and  so  we  may 
use  x'  to  separate  x  and  y. 

2  <  k  <  n  —  t:  We  know  that  there  must  exist  some  s  such  that  Xi . . .  xs_\  = 
y  1 . . .  ys-i,  and  these  substrings  are  constant.  Without  loss  of  generality 
we  may  assume  that  ay  . . .  ay_i  =  CW1  =  y  1 . . .  ys_  1  and  xs  =  1,  and  so 
2  <  s  <  k.  Define  the  following  strings. 

1/  =  1*  ©  y(  1  :  n  —  t)  and 
y"  =  li_1  ©  0  ©  y(l  :  n  -  t) 

As  we  saw  in  the  previous  case,  we  have  that  {yf,  y"}  C  (y)  and 
{y',y"}  fl  S  ^  0.  Now  consider  an  arbitrary  vertex  v  G  B^~(x).  Since 
xs-ixs  =  01  and  2<s<k<n  —  t,  we  know  that  v(i  —  1  :  i)  —  01  for 
some  i  G  [s,  s  +  t]. 

Additionally,  we  consider  1/  and  i  G  [2,  s+t—  1].  For  i  <  t,  we  know  that 
y\i  —  1  :  i)  —  11,  and  for  i  —  t  +  1,  we  have  that  y\i  —  1  :  i)  —  10,  and 
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finally  for  t  +  2  <  i  <  s  +  t  —  lwe  have  y'(i  —  1  :  i)  =  00.  Similarly,  for 
i  G  [2,  s  + 1  —  1],  we  must  have  y"(i  —  1  :  i)  G  {00, 10, 11}.  This  implies 
that  y'(  1  :  s  +  f  —  1)  and  y"(  1  :  s  +  t  — 1)  do  not  contain  the  substring  01, 
and  so  if  if  or  y"  is  a  member  of  we  must  have  either  d{y\  x)  —  t 

or  d(y",  x)  =  t,  respectively.  Hence  we  must  have  y't+i  =  xt  or  y"+i  =  xt 
for  i  G  [l,n  —  t],  and  therefore  that  xk  =  y[+k  =  y"+k  =  Uk,  which  is  a 
contradiction.  Thus  neither  y'  nor  y"  can  be  a  member  of  Bf(x),  and 
hence  both  strings  separate  x  and  y. 

k  >  n  —  t:  Since  we  must  have  X\  =  y\ ,  we  may  assume  without  loss  of 
generality  that  X\  =  y±  ^  0.  Dehne  the  following  strings. 

u  =  0n_fc  ©  x(l  :  k)  and 

v  =  0n~k  ©  y(  1  :  k) 

Clearly  we  have  u  G  B^{x)  and  v  G  B^{y).  Additionally,  we  have 
u(  1  :  n  —  1)  =  u(l  :  n  —  1)  and  un  =  xk  ^  yk  =  vn-  Note  that  this 
implies  that  for  a  =  xt-n+k  we  have  u  =  ua^Xk  and  v  =  ua^yk  with 
Xk  7^  yk-  By  our  argument  at  the  very  beginning  of  the  proof,  at  most 
one  of  these  can  lie  outside  of  S,  and  so  we  must  have  {u,  v}  D  S  ^  0. 

We  now  have  two  cases.  First,  if  both  u  qL  B^{y)  and  v  B^~(x), 
then  any  string  from  {u,  v)  D  S  separates  x  and  y ,  and  we  are  done. 
Otherwise,  assume  without  loss  of  generality  that  u  G  B^{y).  Then  we 
must  have  u  —  w  ©  y(l  :  p)  for  at  least  one  p  G  [n  —  t,n\.  Take  p  to 
be  the  largest  such  p  possible.  Since  y\  ^  0  =  Ui  for  all  i  G  [1  ,n  —  k\, 
we  must  have  p  <  k.  Additionally,  if  p  —  k  then  we  have  y(  1  :  k)  = 
y(  1  :  p)  =  x{l  :  k),  which  is  a  contradiction  since  yk  ^  xk-  This  implies 
that  we  must  have  p  <  k.  Hence  we  must  have  the  following  string  of 
equalities: 


0n~k  ©  x{l  :  k) 


u 

w  ©  y(l  :  p) 
w  ©  x(l  :  p). 


Thus  we  have  x(k  —  p  +  1  :  k)  =  x(l  :  p),  or  ay  =  xk-p+i  for  i  = 
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1,2, ...  ,p.  Additionally  we  note  the  following  equalities  hold: 


to 

?r 

1 

7A 

2k  —  2  p 

< 

2k  —  2  (n  — 

< 

k  +  2t  —  n 

< 

k. 

Hence  since  X;  =  Xk~p+i  for  i  G  [1  ,p\  and  2 (k  —  p)  <  k,  we  know  that 
x(l  :  k)  has  period  (k  —  p).  In  fact,  since  we  chose  p  to  be  maximum, 
x(l  :  k )  is  ( k  —  p)-periodic. 

Next,  we  show  that  u(t  +  1  —  (k  —  p)  :  n)  is  also  ( k  —  p)-periodic.  First, 
we  note  the  following  inequalities: 

n  —  (t  +  1  —  (k  —  p))  +  1  =  n  —  t  +  k  —  p 

>  2(k  —  p). 

The  last  line  comes  from  the  facts:  n  —  t>t>  n—p  >  k  —  p.  Hence  our 
string  length  is  at  least  2(k  —  p),  and  from  our  previous  paragraphs,  so 
long  as  u(t  +  1  —  (k  —  p)  :  n)  is  contained  in  u(n  —  k+1  :  n)  —  x(l  :  k), 
we  know  that  it  must  have  period  (k  —  p).  For  this  we  note  that 

t  +  1  —  (k  —  p)  >  (n  —  p)  +  1  —  (k  —  p)  —  n  —  k  +  1, 

and  so  u(t  +  1  —  (k  —  p)  :  n)  indeed  has  period  (k—p),  and  thus  u  S, 
except  if  (n  —  1)  —  (t  +  1  —  (k  —  p))  +  1  <  2 (k  —  p).  In  this  case  we  must 

have  k  —  p  =  t,  k  =  n  =  2t,  and  p  =  t. 

If  u^t,rn\t  +  l—l  :  n  —  1 )  is  Aperiodic  for  some  i  <  k  —  p  =  t  and  m,  then 
by  Lemma  2.17  we  must  have  that  u^t,m\t+l— l :  n— l)(B(un+m)  is  also 
Aperiodic,  and  hence  u  ^  S.  On  the  other  hand,  if  '«A"d(f+l— £  ;  77,-1) 
is  not  Aperiodic  for  any  £  <  t  and  m,  then  we  note  that  un  =  u2t  =  ut, 
so  again  u  S.  Therefore  in  all  cases  we  have  u  ^  S,  so  we  must  have 
v  G  S. 

All  that  remains  is  to  show  that  v  ^  B)~(x).  We  note  that  if  v  G  B)~(x), 
then  y(  1  :  k)  is  Aperiodic  for  some  £  <  k  +  t  —  n  (using  the  same 
argument  as  we  used  to  show  that  x(l  :  k)  was  (k  —  p)-periodic) .  Since 
x(l  :  k)  =  (r/(l  :  k))('k'r"!  for  some  m,  by  Lemma  2.15  it  is  not  possible 

that  both  x(l  :  k)  and  y(l  :  k)  are  periodic.  Hence  we  must  have 

v  ^  B)~(x),  and  so  we  may  use  v  to  separate  x  and  y. 
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□ 

When  we  combine  the  previous  theorem  with  the  following  result,  we  have 
a  complete  set  of  constructions  for  optimal  t- identifying  codes  in  B(d,  n)  with 
t  >  2  and  n  >  2 1. 

Theorem  3.10.  Let  n  >  3  and  S  =  A]  \  {aqaaqaq . . .  xn-\a  \  a  G  Ad}- 
If  n  is  even,  then  S  is  a  2-identifying  code  for  B(d,n).  If  n  is  odd,  then 
S'  =  (S'  U  |  a  jti  h  E  A2})  \  {(ah)1^ a  |  a  /  6  6  A2}  is  a  2- 

identifying  code  for  B(d,n).  In  both  these  cases,  the  2-identifying  code  is  of 
optimal  size  dn~1(d  —  1). 

Proof.  Consider  an  arbitrary  string  x  =  x  1X2X3  . . .  xn  G  Alf,  and  define  the 
set  T  =  IDs(a:).  We’ll  consider  the  contents  of  T  in  four  cases  based  on  the 
equality  of  aq,a:n_i  and  of  X2,xn.  First  let  C  =  {ax  |  a  G  *4.\  {xn-2}}- 

Case  1.  If  X2  =  xn  and  aq  =  xn-\,  then  T  =  A@C.  Thus  \T\  =  d2  —  d. 

Case  2.  If  X2  ^  xn  and  x\  =  xn-i,  then  T  =  (^4  ©  C)  U  {x}.  If 
x  G  A  ©  C  then  x+  =  ax  for  some  a  G  A  \  {xn-2}-  In  this  case, we  have 
2:20:3  •  •  •  xn  =  ax ix2  ■  ■  ■  xn_2-  This  implies  that  we  have  x±  =  x3  =  x5  =  ■  ■  ■ , 
and  also  that  x2  =  x4  =  x6  =  ■  ■  ■ .  Since  this  case  requires  that  X\  =  xn_i, 
we  must  have  that  either  n  is  even  or  that  x\  —  X2  —  x3  —  ■  ■  ■  —  xn.  In 
either  case,  this  contradicts  our  assumption  that  X2  ^  xn.  Thus  x  ^  A®  C , 
and  we  conclude  that  \T\  =  d2  —  d  +  1. 

Case  3.  If  x2  =  xn  and  X\  a:n_i,  then  T  =  A  ©  { C  U  {a:-}}.  If 
x~  =  ax  for  some  a  G  *4.\{a:n_2},  then  ax\X2  ■  •  ■  xn_2  =  X\X2  ■  ■  ■  xn_i-  This 
implies  that  we  have  X\  —  x2  —  x3  —  ■  ■  ■  —  xn_2  =  xn-i-  This  contradicts 
our  assumption  that  X\  xn-\.  Thus  x~  ax  for  any  a  G  A  \  {a:n_2}, 
and  we  conclude  that  \T\  =  df. 

Case  4.  If  X2  ^  xn  and  X\  xn-\,  then  T  =  (A®  {C  U  {a:-}})  U  {2:}. 
As  in  Case  3,  since  x\  xn-\,A®  {C  U  {a:-}}  contains  d2  distinct  elements. 
Let  us  consider  whether  x  G  A  ©  {C  U  {a:-}}.  If  not,  then  \T\  =  d2  +  1. 
There  are  two  cases  to  consider. 

a.  If  x  G  A  ©  C,  then  x+  =  x~ .  In  this  case,  we  must  have  that 
x2x3X4  ■  ■  ■  xn  —  X1X2  ■  ■  ■  a:n_i,  which  implies  that  we  have  the  following  chain 
of  equalities:  aq  =  x2  =  x3  =  ■  ■  ■  =  a:n_i  =  xn.  This  contradicts  the 
assumptions  that  x2  ^  xn  and  a:i  A  xn- 1-  Thus,  this  case  does  not  occur. 
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b.  If  x  £  A  ©  {a?-},  then  x+  =  ax  for  some  a  £  A.  Then 
x2x3  ■  ■  ■  xn  =  ax  1X2  ■  ■  ■  xn_2-  This  implies  that  x\  —  x3  —  x5  —  ■  ■  ■ ,  and 
also  that  X2  =  X4  =  x§  =  •  •  • .  If  n  is  even,  this  contradicts  onr  assumptions 
that  x2  7^  xn  and  X\  7^  xn-\.  Thus  for  even  n,  this  case  does  not  occur.  For 
n  odd,  this  case  only  occurs  if  x  £  {(ab)^~ a}. 

^ _ 

Thus,  if  n  is  even,  or  n  is  odd  and  x  {(ab)^~ a},  we  can  see  that 
T  =  IDs(ar)  completely  determines  the  string  x.  In  particular,  given  T  we  can 
decide  which  case  we  are  in  based  on  \T\.  We  can  then  determine  x^, ...  ,xn 
based  on  the  content  of  T.  Thus  in  these  cases  S  is  an  identifying  code. 

77.  —  1  ~ 

However,  if  n  is  odd,  and  x  £  {(ab)^~ a}  we  must  change  S  to  get  an 
identifying  code.  Note  that  Bf((ab)^~ a)U{(ab)^~b}  =  Bf((ab)^~b).  Since 
our  set  S  contains  vertices  of  the  form  (ab)^~ a  but  not  (ab)^~b,  these  two 
types  of  vertices  must  have  identical  identifying  sets  with  respect  to  S.  Thus 
by  adding  the  vertices  in  {(ab)^~b},  we  are  able  to  create  distinct  identifying 
sets  with  respect  to  S  U  {{ab)^~b}.  However,  we  note  that  we  now  have 
the  vertices  of  {{afy^b}  and  {(ab),JA~  a}  in  our  identifying  code,  but  that 
— a )  U  {b(ba)~ —  }  =  Bf(b(ba)~ ~).  This  implies  that  the  inclusion 
of  both  (ab)  J~2-b  and  (afo)12^- a  in  our  identifying  code  is  only  necessary  if  they 
are  required  to  identify  vertex  ( ab )R2~ a  from  vertex  b{ba)~ ~ .  So,  as  long  as 
we  can  identify  ( ab )Ii2_ a  differently  from  b{ba)~ ~  without  using  b(ba)~ a-,  we 
need  only  include  (ab)^~b  and  not  (ab)^~ a  in  our  identifying  code.  Since 
these  two  vertices  have  disjoint  in-balls  of  radius  2  for  n  >  3,  they  must  have 
distinct  2-identifying  sets.  Thus  S'  is  a  2-identifying  code  in  this  case.  □ 

Finally,  we  provide  additional  constructions  of  identifying  codes.  Theo¬ 
rems  3.11  and  3.12  have  proofs  very  similar  to  that  of  Theorem  3.9,  so  we 
omit  them  here. 

Theorem  3.11.  Assume  that  d  >  2,  and  n  >  3.  Then  the  following  subset 
S  is  an  optimal  1-identifying  code  of  size  dn~l(d  —  1)  in  B(d,n). 

S  =  {x  £  Af  |  for  some  m  and  i  £  {1,  2},  x^l,rn\l  :  n  —  1)  is  t- periodic  or 
almost  i-periodic,  but  x^1,m\l  :  n  —  1)  ©  (xn  +  m)  is  not.} 
U 

{x  £  Af  \  Xi  jtz  xn  and  x^1,m\l  :  n  —  1) 

is  not  i- periodic  for  any  m  and  i  £  {1,  2}} 
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Theorem  3.12.  Assume  that  d  >  2.  Then  the  following  subset  S  is  a  t- 
identifying  code  of  size  dn^l(d—l)+dt  in  the  directed  de  Bruijn  graph  B(d,  n), 
ifn  =  2t-l>  5. 

S  ={x  G  Aj  |  for  some  m  and  l  <  t  —  1:  x^t,rn\t  +  1  —  l  :  n  —  1)  is  /-periodic, 
but  x^’m\t  +  1  —  l  :  n  —  1)  ©  ( xn  +  m)  is  not.} 

U 

{ x  G  Ad  |  xt  7^  xn  and  x^’m\t  +  1  —  l  :  n  —  1)  is  not  /-periodic 
for  any  m  and  /  <  t  —  1.} 

U 

{x  G  Ad  |  x(<’m)(l  :  n  —  1)  is  almost  /-periodic,  for  some  m.} 

We  note  that  the  construction  in  Theorem  3.12  is  not  optimal.  To  find 
an  optimal  /-identifying  code  when  n  —  2/  —  1  is  an  open  problem  to  be 
considered  in  the  future.  For  the  cases  when  n  <  2/  —  1,  we  have  the  following 
theorem. 

Theorem  3.13.  There  is  no  t-identifying  code  in  the  directed  de  Bruijn  graph 
B(d,  n )  when  n  <  2/  —  2. 

Proof.  Let  u  =  0n_t  ©  1  ©  (L-2  ©  1  and  v  =  0n_t  ©  1  ©  0t_2  ©  0.  Since  Bf{u) 
and  Bf(y)  contain  all  vertices  that  end  with  0n_t  or  0n_t  ©  1  ©  0fc  where 
k  —  0, 1, . . . ,  n  —  t  —  1,  u  and  v  are  /-twins.  Thus  (d,  n)  has  no  /-identifying 
code.  □ 

As  an  additional  treat  for  the  reader,  we  provide  a  simple  construction 
for  1-identifying  codes  in  B(d,n )  whenever  we  have  either  d  >  2  or  n  odd. 

Theorem  3.14.  If  n  is  odd ,  or  n  is  even  and  d  >  2,  then 

s  =  Ad  \  {a  ©  Ad~2  ©  a  |  a  G  Ad } 

is  an  identifying  code  for  B(d,n).  Further  this  identifying  code  has  optimal 
size  ( d  —  l)dn~l . 

Proof.  Define  S  as  in  the  statement  of  the  theorem.  First,  we  will  see  that  the 
identifying  set  for  every  vertex  has  size  either  d  or  d  —  1.  Let  x  =  X1X2  ■  ■  ■  xn, 
then 

N~(x)  fl  S  =  {A®  x1x2  ■  ■  .xn-i)  \  {xn_ixix2  •  •  -xn_i}. 
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If  X\  —  xn,  then  IDs(x)  —  N  (x)  D  S  has  size  d  —  1.  Whereas,  if  xq  ^  xn, 
then  IDs(x)  =  {x}  U  N~(x )  D  S  has  size  d. 

From  this  it  is  clear  that  every  vertex  has  a  non-empty  identifying  set. 
However  we  must  also  show  that  every  identifying  set  is  unique.  Suppose 
there  are  two  distinct  vertices  x,  y  £  1 '/(B(d,n))  such  that  IDg(x)  =  IDs(y). 
Call  their  identical  identifying  set  T.  We  look  at  the  two  cases,  \T\  =  d  and 
\T\  =  d  —  1,  separately  below. 

Suppose  that  |T|  =  d.  Then  {x,y}  C  T  by  our  assumption  on  T  and  our 
earlier  reasoning.  Since  x  ^  y,  this  means  that  B(d,  n )  contains  both  directed 
arcs  x  — >  y  and  y  — »  x.  This  allows  us  to  conclude  that  {x,  y}  =  {( ab)k ,  ( ba)k } 
for  some  distinct  a,b  G  A  with  k  =  n/2.  In  particular  we  must  have  n  even. 
Below  are  the  precise  identifying  sets  for  x  and  y. 

IDS'((ah)fc)  =  {(ab)k,  ( ba)k }  U  {c(a6)fc_1a  |  c  e  A  \  {a,  6}} 

ID s{{ba)k)  =  {( ab)k ,  (6a)fc}  U  {c{ba)k^b  \  c  e  A  \  {a,  b}} 

If  d  >  2  these  two  identifying  sets  are  in  fact  different,  which  is  a  contra¬ 
diction. 

Suppose  that  \T\  =  d  —  1.  Then  neither  x  nor  y  is  in  T,  which  means 
neither  is  in  S.  However  since  their  identifying  sets  are  identical,  this  means 
that  they  have  identical  first  neighborhoods.  By  definition  of  first  neigh¬ 
borhoods,  this  means  that  x  and  y  have  the  same  prefix  but  different  final 
letters.  By  then  definition  of  S,  one  of  x,  y  (if  not  both)  is  a  member  of  S, 
which  is  a  contradiction.  □ 

4  Dominating,  Resolving,  and  Determining 
Sets 

In  this  section  we  examine  other  types  of  vertex  sets  which  identify  or  classify 
vertices  up  to  some  graph  property.  The  properties  used  to  define  these  sets 
are  adjacency,  distance,  and  automorphisms. 

4.1  Dominating  Sets 

Definition  4.1.  A  (directed)  t-dominating  set  is  a  subset  S  C  V(G)  such 
that  for  all  v  G  V(G)  we  have  B^{y)  fl  S  ^  0.  That  is,  S'  is  a  (directed) 
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^-dominating  set  if  every  vertex  in  G  is  within  (directed)  distance  t  of  some 
vertex  in  S.  We  denote  the  size  of  a  minimum  ^-dominating  set  in  a  graph 
Gbylt{G). 

Note  that  by  definition  every  identifying  code  is  also  a  dominating  set, 
but  not  conversely. 

Theorem  4.2.  [  ]  For  d  >  2,  n  >  1,  71  (B(d,n))  =  \-j-j]  ■ 

In  [11]  a  construction  of  a  minimum  dominating  set  for  B(d,n)  is  given. 
Key  to  this  construction  is  the  fact  that  every  integer  m  corresponds  to  a 
string  (base  d)  in  that  we  call  Xm.  The  construction  utilizes  a  special 
integer  m  defined  by: 


f  dn  2  +  dn  4  + - b  dn  2k  + - b  d2  +  1  mod  dn,  if  n  is  even; 

m  \  dn~2  +  dn~4  + - b  dn~2k  + - b  d3  +  d  mod  dn,  if  n  is  odd. 

Let  D  =  {m,m  +  1, . . .  ,m  +  Tipil  —  1}-  Now  let  S  be  the  set  of  strings 

{Xi  |  %  G  D}.  Then  S  is  a  minimum  size  dominating  set  for  B(d,n). 

Next  we  provide  constructions  for  t-dominating  sets.  While  others  have 
considered  some  variations  of  t-dominating  sets  (such  as  perfect  dominating 
sets  in  [12]),  it  does  not  appear  that  the  general  t-dominating  sets  have  been 
considered  in  the  directed  de  Bruijn  graph. 

Theorem  4.3.  The  set  S  U  {0n}  where 


S  =  {x  e  And  |  Xk(t+ 1)  7^  0  for  some  k  G  Z+  and  x^  =  0  for  all  i  <  k(t  +  1)} 


is  a  t-dominating  set  of  size 


1  +  dn~t-\d-l) 


1  -  <r(t+1)  L^tJ  \ 
1  -  d-h+b  ) 


in  B(d,  n). 

Proof.  Let  1  be  a  vertex  in  Ad.  Assume  that  there  are  k  zeros  at  the  begin¬ 
ning  of  x,  but  not  k  +  1  zeros,  i.e.  x  =  0n  or  x  =  0fc  ©  a  ©  x(k  +  2  :  n)  for 
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some  a  ^  0.  Let  l  G  [0,  t]  be  an  integer  so  that  k  +  l  =  t  (mod  t  +  1),  i.e. 
k  +  l  —  t  +  m(t  +  1)  =  (m  +  l)(f  +  1)  —  1  for  some  m  >  0.  Now 

O'  ©  x(l  :  n  -  l)  =  O'  ©  0k  ©  a  ©  x(k  +  2  :  n  -  l)  =  0k+l  ©  a  ©  x(k  +  2  :  n  -  l) 

belongs  to  S  except  if  k  + 1  >  n.  If  k  +  l  >  n,  then  0n~k  ©  x{l  :  k )  =  0”  G 
S  U  {O’1}  dominates  x.  Therefore  every  vertex  is  dominated  by  S  U  {0n}. 

There  are  ^r!_fc(t+1)  •  (d  —  1)  vertices  which  begin  with  exactly  k(t  + 1)  —  1 
zeros.  Moreover,  every  vertex  of  S  \  {0n}  begins  at  most  n  —  1  zeros.  This 
needs  that  k(t  +  1)  <  n  or  1  <  k  <  .  Finally,  0n  is  added  to  the 

dominating  set  S  U  {0n}.  Therefore  the  size  of  S  U  {0n}  is 


LtttJ 

1  +  dn~i{t+l)  ■  (d  ~  !) 


LittJ 

=  1  +  dn(d  -  1) 


=  l  +  dn(d-l) 

=  l  +  dn(d-l) 
=  l  +  dn(d-l) 
=  l  +  dn~t-\d- 


(  LtttJ  \ 

-!+  E 

\  1=0 

i  -  (d-*-i)LtfrJ+1 
1+  l  -  d-h+b 


'rf-o+i)  _  (rf-o+i))  L‘TiJ+1 


l  -  d-o+b 


1  -  d-(*+DLi¥rJ  \ 

1  -  d  )  ' 


□ 

This  result  gives  us  the  following  lower  bound  on  the  size  of  a  ©dominating 
set  in  B(d,  n ). 

Theorem  4.4.  Bounds  on  the  size  of  a  t-dominating  set  in  B(d,  n )  are  given 
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by: 


7 t(B(d,n))  > 


l  +  d^-'id-  1) 
d1'-'  '(f/-  1) 


i-rf-(t+1)  L^ftJ 

1— d-(t+1) 

i-rf-(t+1)  L^ftJ 

1— d_(t+1) 


if  n  =  t  (mod  t  +  1) 


otherwise. 


Proof.  Suppose  that  the  set  S'  is  a  f-dominating  set  in  B(d,n).  Choose  a  G 
AA{0}  and  i  E  Z  so  that  i  <  and  w  G  ^  Let  x  =  u;©a©0*d+1)_1. 

We  note  that  Bf(x)  contains  the  following  elements. 

Bf{x)  =  [y\y  =  w'(Bw(Ba®  for  some  k  G  [0,  t],w'  G  Ad} 

Note  that  for  all  v  ^  x  such  that  v  =  v'  ©  b  ©  (©d+L-1  with  6  G  [d  — 
1],  j  <  ^-j-,  and  v'  G  we  must  have  that  Bf(x)  D  Bf(v)  =  0. 

Hence  each  of  these  types  of  strings  must  dominated  by  a  different  element 
of  S',  and  so  we  must  have  the  following  lower  bound  on  |S"|.  Define  A  = 
|u  |  v  =  v1  ©  b ©  (F'd+d-i  with  b  g  [d  —  l],  j  <  ^,v'  g  A'd~j(t+1)}- 

1 5"  |  > 


\A\ 

Ltfi-I 

J2  dn-j(t+i)  •  (d  - 1) 

i=i 


d 


»n— t— 1 


(d-i) 


l  -  d-d+i) 


Finally,  we  consider  the  string  0n  and  note  that 

Bf( 0”)  =  {z\z  =  z'®  0n_s  with  z'  eAsd,s<t}. 

When  we  compare  Bf(0n)  with  Bf{x),  we  note  that  since  o  ^  0  we  must 
have  that  the  closest  element  of  Bf[x)  to  0™  is  x  itself.  Next  we  note  that 
the  string  closest  to  0n  in  the  set  A  will  occur  when  j  =  LyfiJ-  This  will 
give  us  the  string  with  the  most  0’s  packed  at  the  right  end.  Finally,  if 
n  =  p  mod  t  +  1,  then  this  string  looks  like  v'  ©  b  ©  0n_p_1  with  v'  G  Apd  and 
b  0.  If  p  —  t,  then  we  are  still  unable  to  reach  0",  and  so  we  must  have  at 
least  one  additional  string  in  S'  to  cover  0™.  □ 
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Figure  3:  A  directed  resolving  set  in  the  graph  23(2,3)  (black  vertices). 

4.2  Resolving  Sets 

Definition  4.5.  A  directed  resolving  set  is  a  set  S  such  that  for  all 
u,v  G  V(G )  there  exist  s  G  S'  so  that  d(s,u )  7^  d(s,v).  The  directed 
metric  dimension  is  the  minimum  size  of  a  directed  resolving  set.  An 
example  of  a  directed  resolving  set  in  23(2,  3)  is  given  in  Figure  3. 

Note  that  this  definition  is  not  quite  the  same  as  that  given  in  [6]  (which 
requires  that  there  exist  s  G  S'  so  that  d(u,s )  7^  d(v,s)).  Our  definition 
corresponds  better  to  the  definitions  of  domination  and  of  identifying  codes 
for  directed  graphs  that  are  used  in  this  paper. 

Theorem  4.6.  The  directed  metric  dimension  for  B(d,n )  is  dn_1((i  —  1). 

Proof.  The  following  shows  that  for  each  w  G  A n_1  a  directed  resolving  set 
for  B(d,n )  must  contain  (at  least)  all  but  one  of  the  vertices  with  prefix  w. 
Suppose  that  w  G  An~l,  and  i  7^  j  G  A  so  that  neither  of  w  ©  i,  w  ©  j  is 
in  our  set  S'.  Note  that  if  x,  y  G  V(B(d,n)),  with  x  j -  y,  then  the  distance 
from  x  to  y  is  completely  determined  by  x~  (and  y+ ).  Since  neither  w  ©  i 
nor  w®j  is  in  S',  and  both  have  the  same  prefix,  d(w(Bi,x )  =  d(w(Bj,  x)  for 
all  x  G  S.  Thus  S  is  not  a  directed  resolving  set.  Thus  for  every  w  G  A n_1, 
S  must  contain  (at  least)  all  but  one  of  the  strings  w  ©  j  for  j  G  A.  Thus 
| S'!  >  dn~l(d  —  1).  Since  {w  ©  0  |  w  G  An~1}  can  easily  be  shown  to  be  a 
directed  resolving  set,  we  have  the  desired  equality.  □ 

The  combination  of  Theorem  3.8  and  Theorem  4.2  yields: 

Corollary  4.7.  The  directed  metric  dimension  for  B(d,n)  is  equal  to  the 
minimum  size  of  a  t-identifying  code  for  B(d,  n )  if  2 1  <  n. 
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Figure  4:  A  minimum  size  determining  set  for  B( 2,3)  (black  vertex). 

4.3  Determining  Sets 

In  this  section  we  will  use  a  determining  set  to  help  us  illustrate  the  auto¬ 
morphism  group  of  B(d,  2),  study  the  relationship  between  Aut(£>(d,  n  —  1)) 
and  Aut(£?(<i,  n ))  and  use  the  result  to  find  the  determining  number  for  each 
B(d,n).  First  let’s  recall  some  definitions. 

Definition  4.8.  An  automorphism  of  a  graph  G  is  a  permutation  n  of 
the  vertex  set  such  that  for  all  pairs  of  vertices  u,v  G  14(G),  uv  is  an  edge 
between  u  and  v  if  and  only  if  tt(u)tt(v)  is  an  edge  between  tt(u)  and 
An  automorphism  of  a  directed  graph  G  is  a  permutation  7 r  of  the  vertex 
set  such  that  for  all  pairs  of  vertices  u,v  G  14(G),  uv  is  an  edge  from  u  to 
v  if  and  only  if  7i(u)n(v)  is  an  edge  from  7 r(w)  to  7r(7>).  One  automorphism 
in  the  binary  (directed  or  undirected)  de  Bruijn  graph  is  a  map  that  sends 
each  string  to  its  complement. 

Definition  4.9.  [  ]  A  determining  set  for  G  is  a  set  S  of  vertices  of  G 
with  the  property  that  the  only  automorphism  that  fixes  S  pointwise  is  the 
trivial  automorphism.  The  determining  number  of  G,  denoted  Det(G)  is 
the  minimum  size  of  a  determining  set  for  G.  See  Figure  4  for  an  example. 

Note  that  an  alternate  definition  for  a  determining  set  is  a  set  S  with  the 
property  that  whenever  /,  g  G  Aut(G)  so  that  f(s )  =  g(s)  for  all  s  G  S,  then 
f{v )  =  g{y)  for  all  v  G  14(G).  That  is,  every  automorphism  is  completely 
determined  by  its  action  on  a  determining  set. 

Notice  that  since  for  both  directed  resolving  sets  and  for  identifying  codes, 
since  each  vertex  in  a  graph  is  uniquely  identified  by  its  relationship  to  the 
subset  by  properties  preserved  by  automorphisms,  the  subset  it  also  a  de¬ 
termining  set.  Thus  every  directed  resolving  set  and  every  identifying  code 
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is  a  determining  set.  However,  though  domination  is  preserved  by  automor¬ 
phisms,  vertices  are  not  necessarily  uniquely  identifiable  by  their  relationship 
to  a  dominating  set.  Thus  a  dominating  set  is  not  necessarily  a  determin¬ 
ing  set.  However,  the  relationships  above  mean  that  the  size  of  a  minimum 
determining  set  must  be  at  most  the  size  of  a  minimum  identifying  code  or 
the  directed  metric  dimension.  For  de  Bruijn  graphs  we  have  shown  that  the 
latter  numbers  are  rather  large.  Does  this  mean  that  the  determining  num¬ 
ber  is  also  large.  We  will  see  in  Corollary  4.13  that  the  answer  for  directed 
de  Bruijn  graphs  is  a  resounding  ‘No’. 

Lemma  4.10.  S  =  {00, 11,  22,  33, . . . ,  (d  —  l)(d  —  1)}  is  a  determining  set 
for  B(d,  2). 

Proof.  Suppose  that  a  G  Aut(£>(d,  2))  fixes  S  pointwise.  That  is,  a (ii)  =  ii 
for  all  i  G  A.  Choose  if  ^  rs  G  V(B(d,  2)).  Then  either  i  ^  r  or  j  /  s  (or 
both).  If  i  jtz  r  then  d(ii,  rs )  =  2  which  is  distinct  from  d(ii,  ij )  =  1.  Since  an 
automorphism  of  a  directed  graph  must  preserve  directed  distance,  u(ij)  ^  rs 
if  i  ^  r.  If  j  ^  s,  then  d(rs,jj )  =  2  which  is  distinct  from  d(ij,jj )  =  1. 
Thus,  again  using  that  a  preserves  directed  distance,  a{ij )  ^  rs  if  j  ^  s. 
Thus,  a{ij )  =  ij  for  all  ij  G  V(B(d}  2))  and  therefore  a  is  the  identity  map 
and  S'  is  a  determining  set.  □ 

Note  that  we  are  using  directed  distances  both  from  and  to  elements  of 
the  set  S.  Thus  S  does  not  fit  the  definition  of  a  directed  resolving  set  for 
B(d ,  2)  (by  [6],  this  would  require  that  each  vertex  v  G  V(G)  be  distinguished 
by  it  directed  distance  to  the  vertices  of  the  resolving  set).  However  directed 
distances  both  to  and  from  a  set  can  be  used  in  determining  automorphisms 
of  a  directed  graph. 

Lemma  4.11.  Aut(H(d,  2))  Sym(Sl(i). 

Proof.  Let  a  G  Sym(Sld).  Define  ipa  on  V(B(d,n ))  by  applying  a  to  each 
vertex  coordinate- wise.  That  is  </?CT(a6)  =  a(a)a(b).  It  is  easy  to  show  that 
ipa  preserves  directed  edges  and  thus  is  an  automorphism.  Further,  distinct 
permutations  in  Sym(^4rf)  produce  distinct  automorphisms  since  they  act 
differently  on  the  vertices  of  the  determining  set  S  defined  above.  Thus  we 
have  an  injection  Sym(^4d)  Aut(£>(d,  2)). 

Since  the  vertices  of  S  are  precisely  the  vertices  with  loops,  every  auto¬ 
morphism  of  B(d,2),  must  preserve  S  setwise.  This  provides  the  necessary 
injection  from  Aut (B(d,  2))  Sym(^4).  Thus,  Aut (B(d,  2))  =  Sym(^4d).  □ 
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Note  that  we  can  consider  the  automorphisms  of  B(d,  2)  as  permutations 
of  the  loops,  but  we  can  simultaneously  consider  them  as  permutations  of 
the  symbols  in  the  alphabet  Ad-  It  can  be  useful  to  view  the  automorphisms 
in  these  two  different  ways. 

Note  that  as  shown  in  [1],  B(d ,  n)  can  be  built  inductively  from  B(d,  n  —  1) 
in  the  following  way.  The  vertex  X\  . . .  xn  £  V(B(d,n ))  corresponds  to  the 
directed  edge  from  xi . . .  xn-\  to  x2. . .xn  in  B(d,n  —  1).  The  directed  edge 
B(d,n)  from  x\...xn  — >  X2--.xnxn+i  corresponds  to  the  directed  2-path 
x\  . . .  xn-\  — >  x2  ■  ■ -  xn  — *  X3  . . .  xn+i  in  B(d,  n  —  1).  That  is,  B(d,  n )  is  the 
directed  line  graph  of  the  directed  graph  B(d,n  —  1).  Thus,  by  [8]  (Chapter 
27,  Section  1.1),  Aut(B(d,  n—  1))  =  Aut  (B(d,n))  =  Sym(^4).  In  the  following 
paragraphs  we  see  detail  this  correspondence. 

Suppose  that  tp  £  Aut (B(d,n)).  Since  tp  preserves  directed  edges,  we 
know  that  both  tp[x i  .  . .  xn )  =  a\  . . .  an  and  <p(x 2  . . .  xn+i)  =  b\  . . .  bn  if  and 
only  if  a2  —  bi, . . . ,  an  —  6n-i-  Thus  if  ip(x i  . . .  xn-ixn )  =  ai  .  . .  an-ian  then 
for  every  b  £  A,  p>(x\  . . .  xn_i z)  =  .  .  .  an_iC  for  some  c  £  A.  In  particular, 
this  allow  us  to  define  an  automorphism  <p'  £  Aut (B(d,n  —  1))  correspond¬ 
ing  to  ip  £  Aut  (B(d,7i)).  Define  tp'  by  tp'(x  i...xn_i)  =  ai . . .  an_i  where 
ip(x i . . .  xn )  =  ai  .  . .  an- 1-  By  the  preceding  discussion,  ip'  is  well-dehned.  It 
is  also  clearly  a  bijection  on  vertices  of  B(d,n  —  1).  Consider  x\  . .  .xn_i 
and  x2 . . .  xn-ixn,  the  initial  and  terminal  vertices  of  a  directed  edge  in 
B(d,  7i  —  l).  Since  <p  preserves  directed  edges  if  p>(x\ . . .  xn)  =  a± . . .  an-ian 
then  for  any  z  E  A,  p>(x2  . . .  xnz )  =  a2  . . .  anw  for  some  w  £  A.  By  defini¬ 
tion  of  ip',tp'(x i, . . .  x„,_i)  =  oi . . .  an_!  and  p>\x2  . . .  xn )  =  a2  . . .  an.  Thus  ip' 
preserves  the  directed  edge.  Thus  we  get  Aut (B(d,n))  Aut (B(d,n  —  1)). 

In  the  other  direction,  suppose  we  are  given  tp'  £  Aut(^(d,  n  —  1)).  Since 
ip'  preserves  directed  edges,  and  directed  edges  of  B{d,n  —  1)  are  precisely 
the  vertices  of  B(d ,  n) ,  <p'  defines  a  map  on  vertices  of  B(d,  n).  That  is,  (with 
some  abuse  of  notation) 

<p(x  1  ...Xn)  =  ip(xi  .  .  .  Xn-1  ->  X2  .  .  .  Xn) 

=  <p'(x  1 . . .  xn_x  ->x2...  xn) 

=  <p'(x!  .  .  .xn-l)  ->  <p'(x2  .  .  ,Xn). 

Thus,  given  <p'(x i . . .  xn-\)  =  a± . . .  an-\  then  p>\x2  . . .  xn)  =  a2  . .  ,an  for 
some  an  £  A  and  we  define  <p(x i . .  .xn)  =  a\ . . .  an.  Further,  since  p>'  pre- 
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serves  directed  2-paths,  <p  preserves  directed  edges.  Thus  we  get 

A ut (B(d,n  —  1))  <— >  Aut (B(d,n)). 

Since  the  automorphisms  of  B(d ,  2)  are  permutations  of  the  loops,  and  of 
the  symbols  of  A,  by  induction,  so  are  the  automorphisms  of  B(d,n)  for  all 
n.  Thus  we  have  proved  the  following. 

Theorem  4.12.  Aut (B(d,  n))  =  Sym(^4d)  for  all  n  >  2. 

Corollary  4.13.  Det (B(d,n))  = 

Proof.  Let  S'  be  a  minimum  set  of  vertices  in  which  each  letter  of  Ad~\  occurs 
at  least  once.  It  is  easy  to  see  that  |Sj  =  |~^— !-].  Any  permutation  of  Ad 
that  acts  nontrivially  on  any  letter  of  Ad  must  act  non-trivially  on  any  string 
containing  that  letter.  Thus  if  a  €  PtStab(S),  then  o  must  fix  every  letter 
contained  in  any  string  in  S.  Thus  a  fixes  0, 1, . . . ,  d  —  1  and  therefore  also  d. 
We  can  conclude  that  a  is  the  identity  in  both  Sym(Sld)  and  in  Ant (B(d,  n) 
and  therefore  S  is  a  determining  set.  Thus  Det (B(d,  n))  < 

Further  if  |Sj  <  |~^A]  then  fewer  than  d  —  1  letters  of  Ad  are  used  in 
strings  in  S.  If  a,  b  G  Ad  are  not  represented  in  S,  then  the  transposition  (a  b ) 
in  Sym(Arf)  is  a  non-trivial  automorphism  of  B(d,n)  that  fixes  S  pointwise. 
Thus  S  is  not  a  determining  set.  □ 

Thus  for  directed  de  Bruijn  graphs,  the  determining  number  and  the 
directed  metric  dimension  can  be  vastly  different  in  size. 

5  Future  Work 

There  are  several  directions  that  future  work  in  this  research  area  could  take. 
The  first  is  to  continue  the  research  on  identifying  codes  in  directed  de  Bruijn 
graphs.  One  key  result  missing  from  this  paper  is  the  determination  of  the 
size  of  a  minimum  t-identifying  code  in  the  graph  B(d,n )  when  t  —  2n  —  1. 
It  would  be  ideal  to  determine  both  a  formula  for  which  graphs  B(d,n )  are 
^-identifiable  for  a  given  t ,  as  well  as  constructions  when  it  is  known  that 
such  an  identifying  code  exists. 

An  alternative  direction  for  future  research  is  to  consider  these  same 
vertex  subsets  (identifying  codes,  dominating  sets,  resolving  sets,  and  de¬ 
termining  sets)  on  the  undirected  de  Bruijn  graph.  Little  work  has  been 
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done  and  even  foundational  results  like  the  size  of  a  minimum  dominating 
set  are  currently  missing.  Basic  Matlab  programs  have  shown  that  many 
more  undirected  de  Bruijn  graphs  are  t-identifiable  than  directed,  and  also 
that  the  minimum  size  of  an  identifying  code  is  much  smaller.  For  example, 
through  brute  force  testing  we  have  determined  that  the  minimum  size  of  a 
1-identifying  code  in  £>( 2,  5)  is  12,  whereas  in  the  directed  graph  B( 2,  5)  we 
have  shown  that  the  minimum  size  is  16. 

Finally,  variations  on  the  concept  of  identifying  code  would  be  useful  for 
real-world  applications.  For  example,  one  type  of  variation  known  as  a  k- 
robust  identifying  code  allows  for  up  to  k  sensor  (identifying  code  vertex) 
failures  without  disruption  of  the  identifying  code  properties. 
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